Dataset statistics
| Number of variables | 13 |
|---|---|
| Number of observations | 114 |
| Missing cells | 16 |
| Missing cells (%) | 1.1% |
| Duplicate rows | 36 |
| Duplicate rows (%) | 31.6% |
| Total size in memory | 11.7 KiB |
| Average record size in memory | 105.1 B |
Variable types
| Numeric | 12 |
|---|---|
| Unsupported | 1 |
| Dataset has 36 (31.6%) duplicate rows | Duplicates |
occupiedDwellings is highly correlated with apartmentLessThanFiveStories and 1 other fields | High correlation |
singleDetached is highly correlated with averageNumRooms | High correlation |
semiDetached is highly correlated with averageValue | High correlation |
apartmentLessThanFiveStories is highly correlated with occupiedDwellings | High correlation |
apartmentMoreThanFiveStories is highly correlated with occupiedDwellings and 1 other fields | High correlation |
averageNumRooms is highly correlated with singleDetached and 2 other fields | High correlation |
averageValue is highly correlated with semiDetached | High correlation |
medianHouseholdIncome is highly correlated with averageNumRooms | High correlation |
occupiedDwellings is highly correlated with apartmentLessThanFiveStories | High correlation |
singleDetached is highly correlated with averageNumRooms | High correlation |
apartmentLessThanFiveStories is highly correlated with occupiedDwellings | High correlation |
apartmentMoreThanFiveStories is highly correlated with averageNumRooms | High correlation |
averageNumRooms is highly correlated with singleDetached and 2 other fields | High correlation |
averageValue is highly correlated with medianHouseholdIncome | High correlation |
medianHouseholdIncome is highly correlated with averageNumRooms and 1 other fields | High correlation |
occupiedDwellings is highly correlated with apartmentLessThanFiveStories | High correlation |
apartmentLessThanFiveStories is highly correlated with occupiedDwellings | High correlation |
averageNumRooms is highly correlated with medianHouseholdIncome | High correlation |
medianHouseholdIncome is highly correlated with averageNumRooms | High correlation |
df_index is highly correlated with occupiedDwellings and 8 other fields | High correlation |
occupiedDwellings is highly correlated with df_index and 9 other fields | High correlation |
singleDetached is highly correlated with df_index and 10 other fields | High correlation |
semiDetached is highly correlated with df_index and 7 other fields | High correlation |
rowHouses is highly correlated with df_index and 7 other fields | High correlation |
apartmentInDuplex is highly correlated with df_index and 8 other fields | High correlation |
apartmentLessThanFiveStories is highly correlated with df_index and 9 other fields | High correlation |
apartmentMoreThanFiveStories is highly correlated with df_index and 7 other fields | High correlation |
otherDwellings is highly correlated with df_index and 6 other fields | High correlation |
averageNumRooms is highly correlated with df_index and 10 other fields | High correlation |
averageValue is highly correlated with singleDetached and 2 other fields | High correlation |
medianHouseholdIncome is highly correlated with occupiedDwellings and 5 other fields | High correlation |
averageNumRooms has 4 (3.5%) missing values | Missing |
averageValue has 4 (3.5%) missing values | Missing |
medianHouseholdIncome has 4 (3.5%) missing values | Missing |
medianRent has 4 (3.5%) missing values | Missing |
medianRent is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
df_index has 3 (2.6%) zeros | Zeros |
semiDetached has 7 (6.1%) zeros | Zeros |
rowHouses has 23 (20.2%) zeros | Zeros |
apartmentInDuplex has 2 (1.8%) zeros | Zeros |
apartmentMoreThanFiveStories has 25 (21.9%) zeros | Zeros |
otherDwellings has 49 (43.0%) zeros | Zeros |
Reproduction
| Analysis started | 2021-12-06 18:09:44.540455 |
|---|---|
| Analysis finished | 2021-12-06 18:10:13.204710 |
| Duration | 28.66 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 38 |
|---|---|
| Distinct (%) | 33.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.5 |
| Minimum | 0 |
|---|---|
| Maximum | 37 |
| Zeros | 3 |
| Zeros (%) | 2.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.65 |
| Q1 | 9 |
| median | 18.5 |
| Q3 | 28 |
| 95-th percentile | 35.35 |
| Maximum | 37 |
| Range | 37 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 11.01427071 |
|---|---|
| Coefficient of variation (CV) | 0.5953659844 |
| Kurtosis | -1.201545472 |
| Mean | 18.5 |
| Median Absolute Deviation (MAD) | 9.5 |
| Skewness | 0 |
| Sum | 2109 |
| Variance | 121.3141593 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3 | 2.6% |
| 28 | 3 | 2.6% |
| 21 | 3 | 2.6% |
| 22 | 3 | 2.6% |
| 23 | 3 | 2.6% |
| 24 | 3 | 2.6% |
| 25 | 3 | 2.6% |
| 26 | 3 | 2.6% |
| 27 | 3 | 2.6% |
| 29 | 3 | 2.6% |
| Other values (28) | 84 |
| Value | Count | Frequency (%) |
| 0 | 3 | |
| 1 | 3 | |
| 2 | 3 | |
| 3 | 3 | |
| 4 | 3 | |
| 5 | 3 | |
| 6 | 3 | |
| 7 | 3 | |
| 8 | 3 | |
| 9 | 3 |
| Value | Count | Frequency (%) |
| 37 | 3 | |
| 36 | 3 | |
| 35 | 3 | |
| 34 | 3 | |
| 33 | 3 | |
| 32 | 3 | |
| 31 | 3 | |
| 30 | 3 | |
| 29 | 3 | |
| 28 | 3 |
occupiedDwellings
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 68 |
|---|---|
| Distinct (%) | 59.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1763.991228 |
| Minimum | 665 |
|---|---|
| Maximum | 3265 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 665 |
|---|---|
| 5-th percentile | 900 |
| Q1 | 1151.25 |
| median | 1707.5 |
| Q3 | 2251.25 |
| 95-th percentile | 2849.25 |
| Maximum | 3265 |
| Range | 2600 |
| Interquartile range (IQR) | 1100 |
Descriptive statistics
| Standard deviation | 647.5126608 |
|---|---|
| Coefficient of variation (CV) | 0.3670724948 |
| Kurtosis | -0.8340030349 |
| Mean | 1763.991228 |
| Median Absolute Deviation (MAD) | 557.5 |
| Skewness | 0.3076195824 |
| Sum | 201095 |
| Variance | 419272.6459 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1155 | 4 | 3.5% |
| 2430 | 4 | 3.5% |
| 2350 | 4 | 3.5% |
| 1095 | 4 | 3.5% |
| 900 | 4 | 3.5% |
| 1900 | 3 | 2.6% |
| 665 | 3 | 2.6% |
| 1695 | 3 | 2.6% |
| 2765 | 2 | 1.8% |
| 1875 | 2 | 1.8% |
| Other values (58) | 81 |
| Value | Count | Frequency (%) |
| 665 | 3 | |
| 870 | 1 | 0.9% |
| 900 | 4 | |
| 935 | 1 | 0.9% |
| 960 | 1 | 0.9% |
| 975 | 1 | 0.9% |
| 1010 | 1 | 0.9% |
| 1025 | 2 | |
| 1030 | 2 | |
| 1055 | 1 | 0.9% |
| Value | Count | Frequency (%) |
| 3265 | 2 | |
| 3045 | 2 | |
| 2885 | 2 | |
| 2830 | 1 | |
| 2770 | 2 | |
| 2765 | 2 | |
| 2655 | 2 | |
| 2605 | 1 | |
| 2565 | 1 | |
| 2530 | 1 |
| Distinct | 71 |
|---|---|
| Distinct (%) | 62.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 563.9813596 |
| Minimum | 5 |
|---|---|
| Maximum | 1885 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 5 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 380 |
| median | 557.3 |
| Q3 | 753.26625 |
| 95-th percentile | 1052.17125 |
| Maximum | 1885 |
| Range | 1880 |
| Interquartile range (IQR) | 373.26625 |
Descriptive statistics
| Standard deviation | 360.725422 |
|---|---|
| Coefficient of variation (CV) | 0.6396052207 |
| Kurtosis | 2.084772348 |
| Mean | 563.9813596 |
| Median Absolute Deviation (MAD) | 194.9575 |
| Skewness | 0.7743304359 |
| Sum | 64293.875 |
| Variance | 130122.8301 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 525 | 6 | 5.3% |
| 820 | 4 | 3.5% |
| 435 | 4 | 3.5% |
| 700 | 4 | 3.5% |
| 1005 | 2 | 1.8% |
| 245 | 2 | 1.8% |
| 115 | 2 | 1.8% |
| 880 | 2 | 1.8% |
| 750 | 2 | 1.8% |
| 470 | 2 | 1.8% |
| Other values (61) | 84 |
| Value | Count | Frequency (%) |
| 5 | 2 | |
| 9.63 | 1 | |
| 10.15 | 1 | |
| 10.675 | 1 | |
| 15 | 2 | |
| 20 | 2 | |
| 30 | 2 | |
| 34.56 | 1 | |
| 80 | 2 | |
| 80.32 | 1 |
| Value | Count | Frequency (%) |
| 1885 | 2 | |
| 1579.2 | 1 | |
| 1175 | 2 | |
| 1139.775 | 1 | |
| 1005 | 2 | |
| 955 | 2 | |
| 945 | 2 | |
| 935.22 | 1 | |
| 925.3 | 1 | |
| 890.055 | 1 |
| Distinct | 58 |
|---|---|
| Distinct (%) | 50.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 78.69745614 |
| Minimum | 0 |
|---|---|
| Maximum | 315 |
| Zeros | 7 |
| Zeros (%) | 6.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 15.015 |
| median | 42.76 |
| Q3 | 110 |
| 95-th percentile | 246.675 |
| Maximum | 315 |
| Range | 315 |
| Interquartile range (IQR) | 94.985 |
Descriptive statistics
| Standard deviation | 81.2290373 |
|---|---|
| Coefficient of variation (CV) | 1.032168526 |
| Kurtosis | 0.6974312602 |
| Mean | 78.69745614 |
| Median Absolute Deviation (MAD) | 32.6325 |
| Skewness | 1.262884145 |
| Sum | 8971.51 |
| Variance | 6598.1565 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 30 | 8 | 7.0% |
| 15 | 8 | 7.0% |
| 0 | 7 | 6.1% |
| 25 | 6 | 5.3% |
| 5 | 6 | 5.3% |
| 35 | 4 | 3.5% |
| 60 | 4 | 3.5% |
| 95 | 4 | 3.5% |
| 190 | 2 | 1.8% |
| 50 | 2 | 1.8% |
| Other values (48) | 63 |
| Value | Count | Frequency (%) |
| 0 | 7 | |
| 5 | 6 | |
| 9.495 | 1 | 0.9% |
| 9.75 | 1 | 0.9% |
| 10 | 2 | 1.8% |
| 10.255 | 1 | 0.9% |
| 10.305 | 1 | 0.9% |
| 14.9 | 1 | 0.9% |
| 14.96 | 1 | 0.9% |
| 15 | 8 |
| Value | Count | Frequency (%) |
| 315 | 2 | |
| 295 | 2 | |
| 260.13 | 1 | |
| 259.35 | 1 | |
| 239.85 | 1 | |
| 230 | 2 | |
| 215 | 2 | |
| 210 | 2 | |
| 190.08 | 1 | |
| 190 | 2 |
| Distinct | 55 |
|---|---|
| Distinct (%) | 48.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 94.27223684 |
| Minimum | 0 |
|---|---|
| Maximum | 455 |
| Zeros | 23 |
| Zeros (%) | 20.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 9.8125 |
| median | 39.95 |
| Q3 | 129.41 |
| 95-th percentile | 376.575 |
| Maximum | 455 |
| Range | 455 |
| Interquartile range (IQR) | 119.5975 |
Descriptive statistics
| Standard deviation | 125.8440418 |
|---|---|
| Coefficient of variation (CV) | 1.334900348 |
| Kurtosis | 1.285923317 |
| Mean | 94.27223684 |
| Median Absolute Deviation (MAD) | 39.95 |
| Skewness | 1.556324923 |
| Sum | 10747.035 |
| Variance | 15836.72285 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 23 | 20.2% |
| 15 | 6 | 5.3% |
| 5 | 4 | 3.5% |
| 25 | 4 | 3.5% |
| 55 | 4 | 3.5% |
| 10 | 4 | 3.5% |
| 375 | 2 | 1.8% |
| 45 | 2 | 1.8% |
| 70 | 2 | 1.8% |
| 65 | 2 | 1.8% |
| Other values (45) | 61 |
| Value | Count | Frequency (%) |
| 0 | 23 | |
| 5 | 4 | 3.5% |
| 9.495 | 1 | 0.9% |
| 9.75 | 1 | 0.9% |
| 10 | 4 | 3.5% |
| 10.11 | 1 | 0.9% |
| 10.26 | 1 | 0.9% |
| 15 | 6 | 5.3% |
| 15.48 | 1 | 0.9% |
| 20 | 2 | 1.8% |
| Value | Count | Frequency (%) |
| 455 | 2 | |
| 434.7 | 1 | |
| 430 | 2 | |
| 379.5 | 1 | |
| 375 | 2 | |
| 340 | 2 | |
| 335 | 2 | |
| 324.225 | 1 | |
| 299.98 | 1 | |
| 269.69 | 1 |
| Distinct | 67 |
|---|---|
| Distinct (%) | 58.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 126.1997368 |
| Minimum | 0 |
|---|---|
| Maximum | 555 |
| Zeros | 2 |
| Zeros (%) | 1.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 13.34975 |
| Q1 | 45 |
| median | 92.5 |
| Q3 | 175 |
| 95-th percentile | 338.66625 |
| Maximum | 555 |
| Range | 555 |
| Interquartile range (IQR) | 130 |
Descriptive statistics
| Standard deviation | 112.7504316 |
|---|---|
| Coefficient of variation (CV) | 0.8934284209 |
| Kurtosis | 3.914916875 |
| Mean | 126.1997368 |
| Median Absolute Deviation (MAD) | 52.5 |
| Skewness | 1.811055159 |
| Sum | 14386.77 |
| Variance | 12712.65983 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 6 | 5.3% |
| 45 | 6 | 5.3% |
| 75 | 4 | 3.5% |
| 40 | 4 | 3.5% |
| 210 | 4 | 3.5% |
| 175 | 4 | 3.5% |
| 15 | 4 | 3.5% |
| 10 | 2 | 1.8% |
| 105 | 2 | 1.8% |
| 390 | 2 | 1.8% |
| Other values (57) | 76 |
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 9.63 | 1 | 0.9% |
| 10 | 2 | |
| 10.675 | 1 | 0.9% |
| 14.79 | 1 | 0.9% |
| 15 | 4 | |
| 24.7 | 1 | 0.9% |
| 25 | 2 | |
| 30 | 2 | |
| 30.3 | 1 | 0.9% |
| Value | Count | Frequency (%) |
| 555 | 2 | |
| 534.025 | 1 | |
| 415.53 | 1 | |
| 390 | 2 | |
| 311.025 | 1 | |
| 310 | 2 | |
| 260 | 2 | |
| 250.515 | 1 | |
| 250.035 | 1 | |
| 240 | 2 |
apartmentLessThanFiveStories
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATION| Distinct | 73 |
|---|---|
| Distinct (%) | 64.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 530.1420614 |
| Minimum | 45.24 |
|---|---|
| Maximum | 1010 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 45.24 |
|---|---|
| 5-th percentile | 125.9125 |
| Q1 | 300 |
| median | 507.8625 |
| Q3 | 720 |
| 95-th percentile | 986.285 |
| Maximum | 1010 |
| Range | 964.76 |
| Interquartile range (IQR) | 420 |
Descriptive statistics
| Standard deviation | 261.6592034 |
|---|---|
| Coefficient of variation (CV) | 0.493564315 |
| Kurtosis | -0.9152869322 |
| Mean | 530.1420614 |
| Median Absolute Deviation (MAD) | 207.8625 |
| Skewness | 0.121627664 |
| Sum | 60436.195 |
| Variance | 68465.53871 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 300 | 6 | 5.3% |
| 1000 | 4 | 3.5% |
| 250 | 2 | 1.8% |
| 695 | 2 | 1.8% |
| 895 | 2 | 1.8% |
| 1010 | 2 | 1.8% |
| 625 | 2 | 1.8% |
| 720 | 2 | 1.8% |
| 725 | 2 | 1.8% |
| 455 | 2 | 1.8% |
| Other values (63) | 88 |
| Value | Count | Frequency (%) |
| 45.24 | 1 | |
| 60 | 2 | |
| 90 | 2 | |
| 99.75 | 1 | |
| 140 | 2 | |
| 155.52 | 1 | |
| 160.5 | 1 | |
| 170 | 2 | |
| 230.4 | 1 | |
| 244.725 | 1 |
| Value | Count | Frequency (%) |
| 1010 | 2 | |
| 1000 | 4 | |
| 978.9 | 1 | 0.9% |
| 974.4 | 1 | 0.9% |
| 954.27 | 1 | 0.9% |
| 935.195 | 1 | 0.9% |
| 925 | 2 | |
| 920.01 | 1 | 0.9% |
| 895 | 2 | |
| 870 | 2 |
apartmentMoreThanFiveStories
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 58 |
|---|---|
| Distinct (%) | 50.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 351.6162719 |
| Minimum | 0 |
|---|---|
| Maximum | 1580 |
| Zeros | 25 |
| Zeros (%) | 21.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 27.7625 |
| median | 255.015 |
| Q3 | 573.06 |
| 95-th percentile | 1110 |
| Maximum | 1580 |
| Range | 1580 |
| Interquartile range (IQR) | 545.2975 |
Descriptive statistics
| Standard deviation | 376.6559946 |
|---|---|
| Coefficient of variation (CV) | 1.071213208 |
| Kurtosis | 1.216025306 |
| Mean | 351.6162719 |
| Median Absolute Deviation (MAD) | 250.015 |
| Skewness | 1.289507502 |
| Sum | 40084.255 |
| Variance | 141869.7383 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 25 | 21.9% |
| 325 | 4 | 3.5% |
| 575 | 4 | 3.5% |
| 5 | 2 | 1.8% |
| 50 | 2 | 1.8% |
| 610 | 2 | 1.8% |
| 350 | 2 | 1.8% |
| 675 | 2 | 1.8% |
| 75 | 2 | 1.8% |
| 150 | 2 | 1.8% |
| Other values (48) | 67 |
| Value | Count | Frequency (%) |
| 0 | 25 | |
| 5 | 2 | 1.8% |
| 15.36 | 1 | 0.9% |
| 25.35 | 1 | 0.9% |
| 35 | 2 | 1.8% |
| 50 | 2 | 1.8% |
| 64.9 | 1 | 0.9% |
| 75 | 2 | 1.8% |
| 80.84 | 1 | 0.9% |
| 90 | 2 | 1.8% |
| Value | Count | Frequency (%) |
| 1580 | 2 | |
| 1370.46 | 1 | |
| 1180 | 2 | |
| 1110 | 2 | |
| 1080 | 2 | |
| 1064.45 | 1 | |
| 1009.47 | 1 | |
| 980.49 | 1 | |
| 899.87 | 1 | |
| 885 | 2 |
| Distinct | 27 |
|---|---|
| Distinct (%) | 23.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.95241228 |
| Minimum | 0 |
|---|---|
| Maximum | 150 |
| Zeros | 49 |
| Zeros (%) | 43.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 5 |
| Q3 | 20 |
| 95-th percentile | 96.71375 |
| Maximum | 150 |
| Range | 150 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 34.32012456 |
|---|---|
| Coefficient of variation (CV) | 1.81085785 |
| Kurtosis | 5.668525736 |
| Mean | 18.95241228 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 2.45941641 |
| Sum | 2160.575 |
| Variance | 1177.87095 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 49 | |
| 5 | 18 | 15.8% |
| 20 | 8 | 7.0% |
| 10 | 4 | 3.5% |
| 75 | 4 | 3.5% |
| 35 | 4 | 3.5% |
| 15 | 4 | 3.5% |
| 65 | 2 | 1.8% |
| 120 | 2 | 1.8% |
| 150 | 2 | 1.8% |
| Other values (17) | 17 | 14.9% |
| Value | Count | Frequency (%) |
| 0 | 49 | |
| 5 | 18 | 15.8% |
| 9.425 | 1 | 0.9% |
| 9.495 | 1 | 0.9% |
| 9.6 | 1 | 0.9% |
| 9.825 | 1 | 0.9% |
| 10 | 4 | 3.5% |
| 10.04 | 1 | 0.9% |
| 10.08 | 1 | 0.9% |
| 10.675 | 1 | 0.9% |
| Value | Count | Frequency (%) |
| 150 | 2 | |
| 149.64 | 1 | 0.9% |
| 124.745 | 1 | 0.9% |
| 120 | 2 | |
| 84.175 | 1 | 0.9% |
| 75 | 4 | |
| 70.11 | 1 | 0.9% |
| 65 | 2 | |
| 60.72 | 1 | 0.9% |
| 54.81 | 1 | 0.9% |
averageNumRooms
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 37 |
|---|---|
| Distinct (%) | 33.6% |
| Missing | 4 |
| Missing (%) | 3.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5.773636364 |
| Minimum | 3.5 |
|---|---|
| Maximum | 8.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 3.5 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 5.3 |
| median | 5.6 |
| Q3 | 6.5 |
| 95-th percentile | 7.365 |
| Maximum | 8.5 |
| Range | 5 |
| Interquartile range (IQR) | 1.2 |
Descriptive statistics
| Standard deviation | 1.04694201 |
|---|---|
| Coefficient of variation (CV) | 0.1813314771 |
| Kurtosis | -0.1708518958 |
| Mean | 5.773636364 |
| Median Absolute Deviation (MAD) | 0.7 |
| Skewness | 0.08962215487 |
| Sum | 635.1 |
| Variance | 1.096087573 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5.6 | 9 | 7.9% |
| 5.4 | 8 | 7.0% |
| 6.5 | 7 | 6.1% |
| 5.3 | 7 | 6.1% |
| 6.8 | 6 | 5.3% |
| 6 | 5 | 4.4% |
| 7.2 | 5 | 4.4% |
| 5.5 | 5 | 4.4% |
| 5.7 | 5 | 4.4% |
| 5.2 | 4 | 3.5% |
| Other values (27) | 49 | |
| (Missing) | 4 | 3.5% |
| Value | Count | Frequency (%) |
| 3.5 | 1 | 0.9% |
| 3.7 | 2 | |
| 3.9 | 2 | |
| 4 | 3 | |
| 4.1 | 1 | 0.9% |
| 4.2 | 1 | 0.9% |
| 4.3 | 1 | 0.9% |
| 4.4 | 3 | |
| 4.5 | 3 | |
| 4.7 | 2 |
| Value | Count | Frequency (%) |
| 8.5 | 1 | 0.9% |
| 8.2 | 2 | 1.8% |
| 7.7 | 1 | 0.9% |
| 7.5 | 2 | 1.8% |
| 7.2 | 5 | |
| 7.1 | 2 | 1.8% |
| 7 | 2 | 1.8% |
| 6.9 | 1 | 0.9% |
| 6.8 | 6 | |
| 6.7 | 1 | 0.9% |
| Distinct | 74 |
|---|---|
| Distinct (%) | 67.3% |
| Missing | 4 |
| Missing (%) | 3.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 297805.7182 |
| Minimum | 116910 |
|---|---|
| Maximum | 723201 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 116910 |
|---|---|
| 5-th percentile | 150899.15 |
| Q1 | 225183.25 |
| median | 270969 |
| Q3 | 343631.25 |
| 95-th percentile | 525779 |
| Maximum | 723201 |
| Range | 606291 |
| Interquartile range (IQR) | 118448 |
Descriptive statistics
| Standard deviation | 125633.1459 |
|---|---|
| Coefficient of variation (CV) | 0.4218627724 |
| Kurtosis | 2.992525713 |
| Mean | 297805.7182 |
| Median Absolute Deviation (MAD) | 62095.5 |
| Skewness | 1.567696826 |
| Sum | 32758629 |
| Variance | 1.578368735 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 151431 | 2 | 1.8% |
| 302476 | 2 | 1.8% |
| 340284 | 2 | 1.8% |
| 279612 | 2 | 1.8% |
| 155535 | 2 | 1.8% |
| 241261 | 2 | 1.8% |
| 402059 | 2 | 1.8% |
| 240784 | 2 | 1.8% |
| 304330 | 2 | 1.8% |
| 294316 | 2 | 1.8% |
| Other values (64) | 90 | |
| (Missing) | 4 | 3.5% |
| Value | Count | Frequency (%) |
| 116910 | 1 | |
| 120872 | 1 | |
| 136526 | 1 | |
| 142499 | 1 | |
| 148100 | 1 | |
| 150464 | 1 | |
| 151431 | 2 | |
| 155535 | 2 | |
| 175911 | 1 | |
| 178227 | 1 |
| Value | Count | Frequency (%) |
| 723201 | 2 | |
| 720989 | 2 | |
| 558133 | 1 | |
| 525779 | 2 | |
| 498673 | 2 | |
| 482609 | 2 | |
| 479064 | 1 | |
| 447079 | 1 | |
| 413990 | 1 | |
| 402595 | 2 |
medianHouseholdIncome
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONMISSING| Distinct | 74 |
|---|---|
| Distinct (%) | 67.3% |
| Missing | 4 |
| Missing (%) | 3.5% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 51699.84545 |
| Minimum | 24784 |
|---|---|
| Maximum | 126261 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.0 KiB |
Quantile statistics
| Minimum | 24784 |
|---|---|
| 5-th percentile | 28665.5 |
| Q1 | 40858 |
| median | 47108 |
| Q3 | 60993.25 |
| 95-th percentile | 80018 |
| Maximum | 126261 |
| Range | 101477 |
| Interquartile range (IQR) | 20135.25 |
Descriptive statistics
| Standard deviation | 18090.55011 |
|---|---|
| Coefficient of variation (CV) | 0.3499149746 |
| Kurtosis | 5.08726132 |
| Mean | 51699.84545 |
| Median Absolute Deviation (MAD) | 8298 |
| Skewness | 1.788731864 |
| Sum | 5686983 |
| Variance | 327268003.3 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80018 | 2 | 1.8% |
| 49728 | 2 | 1.8% |
| 47053 | 2 | 1.8% |
| 54902 | 2 | 1.8% |
| 45218 | 2 | 1.8% |
| 61082 | 2 | 1.8% |
| 41594 | 2 | 1.8% |
| 64651 | 2 | 1.8% |
| 67383 | 2 | 1.8% |
| 42759 | 2 | 1.8% |
| Other values (64) | 90 | |
| (Missing) | 4 | 3.5% |
| Value | Count | Frequency (%) |
| 24784 | 1 | |
| 24800 | 1 | |
| 26722 | 1 | |
| 27846 | 1 | |
| 28382 | 2 | |
| 29012 | 1 | |
| 30793 | 2 | |
| 32870 | 2 | |
| 34298 | 1 | |
| 35431 | 1 |
| Value | Count | Frequency (%) |
| 126261 | 2 | |
| 117738 | 1 | |
| 84428 | 2 | |
| 80018 | 2 | |
| 74856 | 2 | |
| 74382 | 1 | |
| 72176 | 1 | |
| 72003 | 2 | |
| 70539 | 1 | |
| 67383 | 2 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | occupiedDwellings | singleDetached | semiDetached | rowHouses | apartmentInDuplex | apartmentLessThanFiveStories | apartmentMoreThanFiveStories | otherDwellings | averageNumRooms | averageValue | medianHouseholdIncome | medianRent | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 2385 | 720.270 | 81.090 | 131.175 | 31.005 | 810.900 | 615.330 | 0.000 | 5.9 | 257003.0 | 46273.0 | 701 |
| 1 | 1 | 2045 | 449.900 | 169.735 | 130.880 | 85.890 | 615.545 | 474.440 | 124.745 | 5.8 | 179541.0 | 40235.0 | 650 |
| 2 | 2 | 2565 | 890.055 | 164.160 | 10.260 | 415.530 | 705.375 | 379.620 | 0.000 | 5.7 | 178227.0 | 41043.0 | 665 |
| 3 | 3 | 975 | 629.850 | 9.750 | 9.750 | 60.450 | 244.725 | 25.350 | 0.000 | 7.2 | 344747.0 | 61874.0 | 647 |
| 4 | 4 | 1320 | 545.160 | 190.080 | 25.080 | 89.760 | 324.720 | 145.200 | 0.000 | 6.7 | 242734.0 | 62928.0 | 722 |
| 5 | 5 | 2830 | 783.910 | 79.240 | 299.980 | 234.890 | 749.950 | 679.200 | 0.000 | 5.7 | 187471.0 | 47163.0 | 685 |
| 6 | 6 | 1965 | 575.745 | 35.370 | 324.225 | 135.585 | 520.725 | 379.245 | 9.825 | 5.6 | 211287.0 | 35702.0 | 703 |
| 7 | 7 | 1920 | 764.160 | 30.720 | 0.000 | 115.200 | 430.080 | 574.080 | 0.000 | 5.5 | 262096.0 | 38319.0 | 683 |
| 8 | 8 | 2300 | 354.200 | 140.300 | 434.700 | 220.800 | 749.800 | 391.000 | 0.000 | 5.3 | 136526.0 | 34298.0 | 637 |
| 9 | 9 | 1440 | 750.240 | 149.760 | 234.720 | 145.440 | 155.520 | 0.000 | 10.080 | 6.8 | 148100.0 | 41323.0 | 475 |
Last rows
| df_index | occupiedDwellings | singleDetached | semiDetached | rowHouses | apartmentInDuplex | apartmentLessThanFiveStories | apartmentMoreThanFiveStories | otherDwellings | averageNumRooms | averageValue | medianHouseholdIncome | medianRent | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 104 | 28 | 1695 | 750.0 | 40.0 | 5.0 | 130.0 | 460.0 | 295.0 | 20.0 | 5.4 | 241927.0 | 44958.0 | 672.0 |
| 105 | 29 | 2075 | 945.0 | 25.0 | 60.0 | 25.0 | 445.0 | 575.0 | 0.0 | 6.5 | 293761.0 | 61082.0 | 1011.0 |
| 106 | 30 | 2180 | 600.0 | 70.0 | 70.0 | 205.0 | 1010.0 | 215.0 | 10.0 | 5.4 | 270969.0 | 44621.0 | 740.0 |
| 107 | 31 | 1730 | 470.0 | 110.0 | 25.0 | 125.0 | 700.0 | 150.0 | 150.0 | 5.3 | 206460.0 | 41594.0 | 750.0 |
| 108 | 32 | 1990 | 880.0 | 315.0 | 85.0 | 90.0 | 625.0 | 0.0 | 0.0 | 6.5 | 239150.0 | 60602.0 | 683.0 |
| 109 | 33 | 1270 | 525.0 | 230.0 | 55.0 | 90.0 | 295.0 | 0.0 | 75.0 | 6.1 | 155535.0 | 49728.0 | 706.0 |
| 110 | 34 | 900 | 700.0 | 65.0 | 0.0 | 10.0 | 60.0 | 5.0 | 65.0 | 7.2 | 192841.0 | 64651.0 | 734.0 |
| 111 | 35 | 3265 | 1885.0 | 100.0 | 375.0 | 45.0 | 525.0 | 325.0 | 0.0 | 7.5 | 302476.0 | 84428.0 | 1309.0 |
| 112 | 36 | 1535 | 20.0 | 15.0 | 10.0 | 15.0 | 400.0 | 1080.0 | 0.0 | NaN | NaN | NaN | NaN |
| 113 | 37 | 2160 | 15.0 | 5.0 | 20.0 | 15.0 | 1000.0 | 1110.0 | 0.0 | NaN | NaN | NaN | NaN |
Most frequently occurring
| df_index | occupiedDwellings | singleDetached | semiDetached | rowHouses | apartmentInDuplex | apartmentLessThanFiveStories | apartmentMoreThanFiveStories | otherDwellings | averageNumRooms | averageValue | medianHouseholdIncome | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 0 | 2430 | 700.0 | 95.0 | 145.0 | 45.0 | 870.0 | 575.0 | 0.0 | 5.8 | 283020.0 | 45134.0 | 2 |
| 1 | 1 | 2095 | 455.0 | 175.0 | 135.0 | 75.0 | 650.0 | 480.0 | 120.0 | 5.7 | 230158.0 | 41444.0 | 2 |
| 2 | 2 | 3045 | 1005.0 | 210.0 | 25.0 | 390.0 | 735.0 | 675.0 | 0.0 | 5.6 | 232374.0 | 48312.0 | 2 |
| 3 | 3 | 1155 | 730.0 | 30.0 | 0.0 | 55.0 | 300.0 | 35.0 | 0.0 | 7.2 | 402595.0 | 74856.0 | 2 |
| 4 | 4 | 1455 | 615.0 | 215.0 | 40.0 | 90.0 | 300.0 | 195.0 | 0.0 | 6.6 | 290299.0 | 65536.0 | 2 |
| 5 | 5 | 2885 | 820.0 | 75.0 | 340.0 | 260.0 | 710.0 | 670.0 | 5.0 | 5.6 | 253840.0 | 56099.0 | 2 |
| 6 | 6 | 1945 | 525.0 | 15.0 | 335.0 | 160.0 | 535.0 | 370.0 | 5.0 | 5.4 | 240784.0 | 36344.0 | 2 |
| 7 | 7 | 1900 | 795.0 | 30.0 | 0.0 | 85.0 | 415.0 | 570.0 | 0.0 | 5.5 | 340284.0 | 46551.0 | 2 |
| 8 | 8 | 2350 | 380.0 | 190.0 | 430.0 | 175.0 | 725.0 | 445.0 | 5.0 | 5.2 | 184205.0 | 40858.0 | 2 |
| 9 | 9 | 1495 | 820.0 | 155.0 | 220.0 | 120.0 | 170.0 | 0.0 | 5.0 | 6.8 | 241261.0 | 50193.0 | 2 |